Personalizing XML Full Text Search in PIMENTO
نویسندگان
چکیده
XML search is increasing in popularity as more and larger XML repositories are becoming available. The accuracy of XML search varies across different systems and a lot of effort is put into designing scoring functions tailored to specific datasets. For example, the INEX effort [8] aims at improving the search relevance of IEEE XML data collections. To the best of our knowledge none of the existing XML search solutions incorporates user information to determine relevant query answers. In PIMENTO we argue that there is no scoring function that can meet all user-related information and advocate the idea of incorporating user profiles into XML search in order to customize query answers and improve search quality. Personalization is used in a variety of applications: in telecommunications it is used to direct user calls based on the caller context, in Web search the ranking of query answers may be modified using the user’s navigational and search patterns. In the relational database context, query personalization has been studied extensively [6,12] and shown to be effective in practice. In PIMENTO a user profile is composed of two kinds of preference rules: scoping rules and ordering rules. Scoping Rules are used to expand or restrict the original query result. Ordering Rules are combined with query scoring to customize the ranking of query answers, hence overriding the ranking strategy of the query engine. Query personalization in PIMENTO is defined as the process of rewriting a user query using scoping rules and ranking query answers using ordering rules. Enforcing scoping rules is not straightforward: there can be a large number of rewritings of the user query when scoping rules are considered. To enforce efficiently scoping rules we take into account existing query relaxation work [2,15]. A key contribution of our approach is that scoping rules can be incorporated into a single query plan without requiring actual query rewriting. Ultimately, the user is only interested in the top several answers. Consequently, understanding how to combine user profiles with topk processing is a key aspect of efficient query personalization. A core contribution of PIMENTO
منابع مشابه
Personalizing XML Text Search in PimenT
A growing number of text-rich XML repositories are being made available. As a result, more efforts have been deployed to provide XML fulltext search that combines querying structure with complex conditions on text ranging from simple keyword search to sophisticated proximity search composed with stemming and thesaurus. However, one of the key challenges in full-text search is to match users’ ex...
متن کاملFull Text Search in XML Documents
The goal of this paper is to show how XML structure information can be used for full text search in XML documents. Existing products for full text search are investigated regarding their support of XML. The main aspect of this investigation is how the search scope of queries is specified and narrowed by taking advantage of the XML format. Considering the results of this investigation, a suggest...
متن کاملA Method for Evaluating Full-text Search Queries in Native XML Databases
In this paper we consider the problem of efficiently producing results for full-text keyword search queries over XML documents. We describe full-text search query semantics and propose a method for efficient evaluation of keyword search queries with these semantics suitable for native XML databases. Method uses inverted file index which may be efficiently updated when a part of some XML documen...
متن کاملFull-Text and Structural Indexing of XML Documents on B+-Tree
XML query processing is one of the most active areas of database research. Although the main focus of past research has been the processing of structural XML queries, there are growing demands for a fulltext search for XML documents. In this paper, we propose XICS (XML Indices for Content and Structural search), which aims at high-speed processing of both full-text and structural queries in XML...
متن کاملFull-Text and Structural XML Indexing on B+-Tree
XML query processing is one of the most active areas of database research. Although the main focus of past research has been the processing of structural XML queries, there are growing demands for a full-text search for XML documents. In this paper, we propose XICS (XML Indices for Content and Structural search), novel indices built on a B-tree, for the fast processing of queries that involve s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008